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1.  Introduction 


We  consider  a  problem  of  resource  allocation  In  a  network  of  computers. 
Imagine  an  n  node  undirected  graph  In  which  the  nodes  represent  processors 
and  the  edges  communication  paths.  We  have  t  resources  which  can  be  placed 
on  various  nodes  of  the  graph.  Requests  lor  the  resources  originate  with 
equal  probability  at  any  of  a  designated  set  of  request  nodes.  Each  request 
is  serviced  by  matching  it  with  some  resource.  The  cos L  of  that  service  is 
the  shortest  distance  between  the  request  and  the  resource  wiLh  which  it  is 
matched.  The  cost  of  servicing  a  simultaneous  seL  of  requests  is  the  minimum 
total  cost  that  can  be  obtained  over  ail  possible  matchings  of  those  requests 
to  resources. 

For  example,  the  resources  might  be  available  processors  and  the  requests 
be  users  wishing  service.  The  minimum  cost  matching  gives  the  best  way  of 
assigning  processors  to  users  so  as  to  minimize  average  distance  in  the  net¬ 
work  between  the  user  and  his  processor. 

Averaging  over  all  possible  patterns  of  a  fixed  number  of  requests  at 
the  designated  request  points,  we  get  an  expected  cost  of  servicing  the 
requests.  We  are  interested  in  how  to  place  the  resources  so  as  to  minimize 
the  expected  cost  of  servicing  a  random  set  of  requests  of  a  fixed  size, 
and  we  wish  to  get  good  bounds  on  that  expected  cost. 

Our  results  to  date  are  for  the  special  but  interesting  case  with  the 
following  restrictions: 

(i)  The  graph  is  a  complete  binary  tree  with  n  leaves; 

(ii)  Resources  can  be  placed  anywhere  In  the  tree,  but  requests  ori¬ 
ginate  only  .it  leaves; 


% 
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(lit)  Requests  are  independent  random  events  uniformly  distributed 
over  the  leaves; 

(iv)  The  number  of  requests  equals  the  number  of  resources. 

2 .  Properties  of  Optimal  Placements 

An  assignment  of  t  resources  to  nodes  is  optimal  if  it  minimizes  the 
expected  cost  of  servicing  a  random  set  of  t  requests  at  the  leaves.  In¬ 
tuitively,  if  t  is  small  relative  to  n,  the  number  of  leaves  in  the  ei.ee, 
it  should  be  better  to  place  the  resources  up  toward  the  root,  whereas  if 
t  is  large  relative  to  n,  then  one  should  do  betLer  by  dividing  up  the  re¬ 
sources  equally  among  the  leaves.  In  fact,  this  is  not  strictly  true  unless 
t  is  a  power  of  2.  The  structure  of  optimal  placement  for  non-power-of-two 
numbers  of  resources  is  rich  and  complicated;  nevertheless,  an  optimal 
placement  can  be  found  quickly  by  a  straightforward  algorithm.  We  develop 

those  results  In  the  remainder  of  this  section. 

We  begin  by  looking  at  the  optimal  matching  of  a  fixed  set  of  t  requests 

to  the  t  resources.  A  matching  is  a  set  of  directed  paths  from  distinct  re¬ 
quests  to  distinct  resources.  The  directed  flow  along  an  edge  e  in  the  tree 
is  the  number  of  paths  in  the  matching  which  contain  that  edge  in  the  desig¬ 
nated  direction  (towards  or  away  from  the  root).  Because  we  are  looking  only 
at  trees,  a  necessary  and  sufficient  condition  for  a  matching  to  be  optimal 
Is  that  no  edge  have  flow  in  both  directions.  It  follows  that  the  flow  along 
any  edge  e  in  an  optimal  matching  is  determined  solely  by  the  number  of  requests 

In  the  subtree  T  which  e  connects  to  the  rest  of  the  tree,  as  shown  below: 
e 
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if  the  number  of  requests  in  T  exceeds  the  number  of  resources  in  T  , 

e  e 

ttien  the  difference  flows  through  e  ouL  of  T  .  If  the  number  of  resources 

e 

exceeds  the  number  of  requests,  then  the  difference  flows  through  e  into 
T  .  The  flow  is  zero  if  the  number  of  requests  equals  the  number  of  resources 
in  T  ^ .  Letting  s  be  Lhe  number  of  resources  and  i  the  number  of  requests 
in  T  ,  the  absolute  flow  is  defined  to  be  |s-i|.  Summing  Lhe  absolute  flow 
over  all  edges  in  the  tree  gives  the  cost  of  the  optimal  matching. 

We  now  consider  all.  Lhe  different  possible  patterns  of  requests  and 
look  at  tile  expected  absolute  flow  along  an  edge  e.  if  T  has  s  resources, 
this  is  given  by 


t 

I  ProbLexactly  i 
i=0 


requests  occur  in  T 

e 


Is-il . 


Since  the  requests  are  independent  and  uniformly  distributed,  the  probability 
of  exactly  i  requests  in  a  subtree  is  binomial ly  distributed  and  is  given 
by  ^  >  where  L  is  the  total  number  of  requests,  p^  =  (//  leaves 

in  T^)/n  fa  the  probability  that  a  given  request  will  fall  in  ,  and 
q^  =  1-p^.  Thus,  the  expected  absolute  flow  in  e  is  given  by 


for 

T  . 


f  (s)  =  Y.  (^)pi*qt  Is-il  . 
i=0Vl'  °  ° 


Let  L  be  tlu*  set  ol  edges  in  the 
e 

each  edge  d  in  f  ,  let  s  .  be 

e  u 

The  total  expected  absolute  flow  in 


Fh  .  'd  Ud 

e  J  i  t 

e 


subtree  T  ,  including 

the  number  of  resources 

T  is  del  tiled  to  be 
e 


e  itsej  1  . 
ill  the  subtree 


d.  f 

e 


where  £  is  the  vector  ls^) 


Note  that  F  Includes  the  1  low  along  the  edge  leading  out  of  the  root 
c 

ot  the  subtree  as  well  as  the  tlows  on  all  the  edges  internal  to  the 
subt lee . 

The  definitions  lot  i  and  K  make  perfectly  good  mathematical 

e 

sense  for  arbltraiy  real  values  ol  s.  This  cor i es ponds  to  the  situation 
in  which  resources  are  continuously  divlsable,  but  the  requests  neverthe¬ 
less  are  integral.  We  consider  the  continuous  case  tor  technical  conveni¬ 
ence  . 

Let  the  optimal  I  1 ow  In  the  continuous  case  be  given  by 

K^>l(s)  -  min  iFj,  (js)  I  s  -  ,  and  each  s^  in  s  is  a  non-negative 

e  e 

real  luimhci  1 

For  the  inLeger  ease,  we  have 

i\!1)l(s)  =  min  (F  (s)  I  s  =  Ls,  and  each  s.  in  s  is  in  W). 

L  L  ~  d 

e  e 

F^1  is  delined  only  lor  s>  M. 
t 

e 

Theorem  I.  Kaeh  ol  the  functions  f  ,  F,,  ,  and  f!^L  is  convex, 

-  v  h  h 

e  e 

piecewise  linear,  with  vertices  occurlng  at  integer  values  of  s. 

Theorem  .  F^*>l(s)  =  F,u*,L(s)  lor  all  S'  H . 

-  !'.  r. 

v  c 

This  tells  us  that  the  I reedom  to  subdivide  resources  cannot  result  in  a 
lower  cost  placement  than  can  be  achieved  by  placing  whole  resources. 

The  lair  sli a i e  placement  ot  t  resources,  which  plays  a  special  role 
In  our  development,  puts  t/u  resource  units  on  each  leal  and  no  resources 
on  any  Internal  node.  fills  placement  has  l  lie  property  that  the  fraction  of 
resiiui  ces  on  any  subtree  is  the  same  as  the  I  (action  ol  leaves  which  lhat 
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subtrce  contains.  One  might  conjecture  that  Lhe  iair  share  placement  is 
optimal,  and  indeed  it  is  when  t/n  is  an  integer,  but  there  are  cases 
where  it  is  demonstrably  non-optimal.  Nevertheless,  it  is  always  close 
to  optimal,  a  fact  that  is  crucial  to  the  correctness  of  our  algoritluu. 

Theorem  J.  There  exists  an  optimal  placement  (of  whole  resources) 

such  that  (i)  for  every  pair  of  brother  subtrees  T_  and  , ,  the 

numbers  of  resources  on  T  and  T  ,  differ  by  at  most  1; 

e  e 

(ii)  the  number  of  resources  on  each  subtree  T  differs  from 

e 

Lhe  fair  share  number  for  that  subtree  by  less  than  one. 

The  proof  of  this  theorem  rests  heavily  on  Theorems  1  and  2  and  uses 
the  little-known  fact  that  the  mean  and  the  median  of  the  binomial  distri¬ 
bution  differ  by  an  amount  less  than  °ne  I  1,1 1 . 

Corollary.  if  Lhe  number  of  resources  is  a  multiple  of  the  number  of 
leaves,  then  the  fair  share  placement  is  optimal. 

From  Theorem  },  we  get  a  time  0(iog  n)  dynamic-programming  algorithm 
for  finding  an  optimal  placement  of  t  resources  on  a  complete  binary  tree 
of  n  leaves.  The  algorithm  proceeds  by  finding,  for  each  L  =  0,1,..., 
log  n  the  cost  of  the  optimal  placement  of  boLh  Lltl  and  TlH  resources 
on  a  subtree  of  2L  leaves,  where  h  =  t*2  /n  is  the  fair  share  number  for 
that  subtree.  This  takes  only  a  constant  amount  of  time  given  the  same 
information  for  1.-1;  hence  Lhe  total  time  Is  l)(log  n) .  A  "description" 
ot  the  actual  placement  can  also  be  found  ill  time  0(log  n) ,  but  of  course 
to  produce  the  1'ul  ly-writ ten-out  placement  takes  time  0(n)  .  This  algorithm 


establishes: 
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Theorem  4.  The  cost  of  an  optimal  placement  of  I  resources  on  a 
complete  binary  tree  of  n  leaves,  and  a  "description"  of  that  placement, 
can  be  computed  in  time  O(log  n)  on  a  RAM. 

We  conclude  this  section  with  an  example  showing  that  the  fair  share 
placement  is  not  always  optimal  and  illustrating  some  of  the  structure  of 
optimal  placements.  An  optimal  placement  of  11  resources  on  a  16-leaf 
complete  binary  tree  is  shown  below: 


0 

/  \ 

0  111 

/\  A  /v  /v 

0  0  0  0  0  0  0  0 

A  A  /V  A  A  /V  /V  A 

0  10  10  10  10  10  10  10  1 


The  expected  cost  of  this  placement  is  24.902...  .  The  fair  share  place¬ 
ment,  which  places  0.b875  resources  on  each  leaf,  has  expected  cost  25.404. 

J.  Bounds  on  the  Costs  of  Optimal  Layouts 


In  this  section,  we  show  that  the  optimal  placement  of  t  resources 
results  in  an  expected  cost  of  at  most 

c  *  t  +  2  log  — 

where  c  is  a  "small"  constant.  Hence,  the  average  cost  per  request  is 
constant  when  the  number  of  resources  is  at  least  proportional  to  the 
number  of  leaves  in  the  tree. 

We  begin  by  defining  a  s imp  1 e  placement  to  be  the  one  which  puts  t/2 
resources  on  each  node  at  level  L  ( 1 rom  the  root),  where  1.  =  mlnflog  n. 
Flog  tl)  .  Note  that  when  t^n/2,  the  simple  and  iair  share  placements  are 
the  Siime.  Also,  no  node  La  a  simple  placement  has  a  non-zero  number  ot 


resources 
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Theorem  5.  Cost  of  the  simple  placement  ■  cost  of  the  fair  share 
placement  s  c*t  +  2  log  •—  for  some  constant  c. 

Theorem  b.  F°^>t(t)  ^  cost  of  simple  placement  i  2L  -  1  +  F°^>t(t) 
where  L  =  mini  log  a,  ("leg  t*l } . 

Theorem  7.  If  t  is  a  power  of  2,  the  simple  placement  is  optimal. 

4 .  Discuss  ion 

A  related  problem,  which  formed  the  original  motivation  for  this  study, 
was  to  find  and  analyze  algorithms  for  a  distributed  "ticket"  system  such 
as  might  be  used  to  sell  tickets  to  a  sporting  event.  The  tickets  are  as¬ 
sumed  to  reside  at  various  nodes  in  the  network,  and  the  distance  from  a 
request  to  the  ticket  which  is  eventually  used  to  grant  the  request  is  some 
measure  of  the  waiting  time  to  service  that  request.  The  difference  between 
that  problem  and  the  one  in  this  paper  is  that  the  requests  come  in  sequenti¬ 
ally  and  it  is  not  possible  to  achieve  an  optimal  matching  of  requests  to 
tickets  without  knowing  the  locations  of  future  requests.  We  are  currently 
trying  to  extend  our  results  to  this  case  of  serial  requests  for  resources. 

It  seems  clear  that  any  static  placement  of  tickets  on  nodes  will  lead 
to  deteriorating  performance  as  more  and  more  tickets  are  sold.  Thus,  even 
though  the  average  waiting  time  might  be  acceptably  small,  the  time  might 
nevertheless  grow  unacceptably  large  when  only  few  tickets  are  left.  To  help 
alleviate  this  problem,  one  might  consider  algorithms  which  dynamically  move 
tickets  around  to  try  to  improve  the  future  expected  waiting  time,  analogous 
to  the  various  balancing  schemes  for  binary  search  trees.  We  have  an  algorithm 
which  does  such  rebalancing,  but  it  is  still  unclear  to  what  extent,  it  any, 


the  rebalancing  actually  helps. 
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Finally,  it  would  bo  ulco  to  remove  some  ol  the  Leelmical  restrictions 
we  placed  on  the  problem  and  generalize  the  results  to  cases  of  arbitrary 
graphs ,  arbitrary  subsets  oi  nodes  for  location  of  requests  and  placement 
ot  resources,  numbers  of  requests  different  from  number  of  resources,  and 
non-uniform  distributions  on  the  probability  of  a  request. 
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